Time Series Regression with Meta-Clusters
نویسنده
چکیده
This paper presents a preliminary attempt to apply classification of time series using meta-clusters in order to improve the quality of regression models. In this case, clustering was performed as a method to obtain subgroups of time series data with normal distribution from the inflow into wastewater treatment plant data, composed of several groups differing by mean value. Two simple algorithms, K-mean and EM, were chosen as a clustering method. The Rand index was used to measure the similarity. After simple meta-clustering, a regression model was performed for each subgroups. The final model was a sum of the subgroups models. The quality of the obtained model was compared with the regression model made using the same explanatory variables, but with no clustering of data. Results were compared using determination coefficient (R2), measure of prediction accuracymean absolute percentage error (MAPE) and comparison on a linear chart. Preliminary results allow us to foresee the potential of the presented technique. Keywords—Clustering, Data analysis, Data mining, Predictive models.
منابع مشابه
Ensemble Kernel Learning Model for Prediction of Time Series Based on the Support Vector Regression and Meta Heuristic Search
In this paper, a method for predicting time series is presented. Time series prediction is a process which predicted future system values based on information obtained from past and present data points. Time series prediction models are widely used in various fields of engineering, economics, etc. The main purpose of using different models for time series prediction is to make the forecast with...
متن کاملA Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach
In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...
متن کاملScaling relations in dynamical evolution of star clusters
We have carried out a series of small scale collisional N-body calculations of single-mass star clusters to investigate the dependence of the lifetime of star clusters on their initial parameters. Our models move through an external galaxy potential with a logarithmic density profile and they are limited by a cut-off radius. In order to find scaling relations between the lifetime of star cluste...
متن کاملStock Price Prediction using Machine Learning and Swarm Intelligence
Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...
متن کاملTime Series as a Point - A Novel Approach for Time Series Cluster Visualization
of temporal data and finding temporal patterns, regularities, trends, clusters in sets of temporal data. Wavelet transform provides a means to analyze a temporal data at multiple resolutions. In this paper we propose a methodology for representing a time series as histograms at different resolutions using wavelet transform. Then we fit a regression line on the cumulative histogram and express t...
متن کامل